Transmitter for enabling switching involving a 3D video signal

ABSTRACT

A transmitter according to the present invention includes a decoder that decodes a video signal received from outside and acquires identification information including a format concerning 2D or 3D of video from a layer corresponding to each frame of the video signal and a backend processor that performs spatial or temporal scaling of video data by the video data based on the identification information and, when the format of the video is switched, switches parameters for the scaling adjusting to switching timing.

CROSS-REFERENCE TO RELATED APPLICATION

The present application is a continuation of U.S. patent applicationSer. No. 13/142,336, filed on Aug. 19, 2011, which application is anational phase entry under 35 U.S.C. § 371 of International ApplicationNo. PCT/JP2010/069154 filed Oct. 28, 2010, published on May 12, 2011 asWO2011055674 A1, which claims priority from Japanese Patent ApplicationNo. JP 2010-022240 filed in the Japanese Patent Office on Feb. 3, 2010,which claims priority from Japanese Patent Application No. JP2009-254471 filed in the Japanese Patent Office on Nov. 5, 2009, all ofwhich are incorporated herein by reference.

TECHNICAL FIELD

The present invention relates to a receiver, a transmitter, acommunication system, a display control method, a program, and a datastructure.

BACKGROUND ART

In the related art, various systems to display stereoscopic video havebeen known. For example, as described in Patent Literatures 1 to 3, amethod of alternately supplying a left-eye image and a right-eye imagewith a parallax therebetween to a display in a predetermined cycle andobserving these images by using glasses including a liquid crystalshutter driven in synchronization with the predetermined cycle has beenknown.

CITATION LIST Patent Literature

Patent Literature 1: JP 9-138384A

Patent Literature 2: JP 2000-36969A

Patent Literature 3: JP 2003-45343A

Patent Literature 4: US Patent Application Laid-Open No. 2009/092335

Patent Literature 5: US Patent Application Laid-Open No. 2009/096864

SUMMARY OF INVENTION Technical Problem

When it is assumed that content such as a TV program is transmitted froma broadcasting station to a user's TV set, transmitting by switching 3Dvideo and 2D video as video data can be considered. For example, a casewhen the main portion of a program is transmitted as 3D video andcommercials as 2D video can be considered.

Various modes such as the side-by-side mode, top and bottom mode, andfield sequential mode are known as 3D video modes. Thus, it is assumedthat video data transmitted from the side of a broadcasting station istransmitted after being dynamically switched between various methodsadjusting to video content.

In such a case, it is assumed that the TV set on the user side may notbe able to display an image properly immediately after switching. Thisis because various parameters (such as the image size and frequency)concerning video before and after switching change and thus, it becomesnecessary to switching display processing on the side of the TV set. Todisplay images properly immediately after switching, it is necessary tocommunicate details including timing of format switching to thereceiver, but the technology in the related art in Patent Literatures 4and 5 does not assume such a mechanism.

The present invention has been made in view of the above issue and it isdesirable to provide a receiver capable of realizing the proper displayimmediately after switching when a video signal including stereoscopicvideo is switched, a transmitter, a communication system, a displaycontrol method, a program, and a data structure.

Solution to Problem

According to an aspect of the present invention to solve the aboveissue, there is provided a receiver including a decode processing unitthat decodes a video signal received from outside; an identificationinformation acquisition unit that acquires identification informationincluding a format concerning 2D or 3D of video from a layercorresponding to each frame of the video signal; and a processing unitthat performs processing for an image display by the video signal basedon the identification signal.

The processing unit may include a scaling unit that performs spatial ortemporal scaling of video data by the video data based on theidentification information and, when the format of the video isswitched, switches parameters for the scaling adjusting to switchingtiming.

The identification information may contain offset information indicatingthe timing when the switching of the 3D format of the video occurs orthe timing when the switching of the 3D video and the 2D video occursand the scaling unit may start, based on the offset information, theprocessing to switch the parameters before the timing when the switchingoccurs.

The identification information may contain offset information indicatingthe timing when the switching of the 3D format of the video occurs orthe timing when the switching of the 3D video and the 2D video occursand the scaling unit may start, based on the offset information, theprocessing to switch settings of a temporary buffer to hold the decodedvideo signal before the timing when the switching occurs.

The identification information may contain information representing atleast one of a side-by-side mode, a top and bottom mode, and a framesequential mode as the 3D format.

The identification information may contain information indicatingwhether spatial or temporal phases of two views for a left eye and aright eye according to the 3D format have same phases or differentphases.

The scaling unit may perform, as the spatial scaling, the processing toexpand the video decoded by the decode processing unit in a screenvertical direction or a screen horizontal direction.

The scaling unit may perform, as the temporal scaling, copying orinterpolation processing of video frames arranged chronologically.

The identification information may contain information to prohibit aconversion of the format of the video.

According to another aspect of the present invention to solve the aboveissue, there is provided a transmitter including a code processing unitthat encodes a video signal, an identification information insertionunit that inserts identification information including at least a formatconcerning 2D or 3D of video into a layer corresponding to each frame ofthe video signal, and a transmitting unit that transmits the videosignal into which the identification information is inserted.

According to another aspect of the present invention to solve the aboveissue, there is provided a communication system including a transmitterhaving a code processing unit that encodes a video signal, anidentification information insertion unit that inserts identificationinformation including a format concerning 2D or 3D of video into a layercorresponding to each frame of the video signal, and a transmitting unitthat transmits the video signal into which the identificationinformation is inserted, a decode processing unit that decodes the videosignal received from the transmitter, and a receiver having anidentification information acquisition unit that acquires theidentification information including the format concerning 2D or 3D ofthe video from the layer corresponding to each frame of the video signaland a processing unit that performs processing for an image display bythe video signal based on the identification signal.

According to another aspect of the present invention to solve the aboveissue, there is provided an image display method including decoding avideo signal received from outside, acquiring identification informationincluding a format concerning 2D or 3D of video from a layercorresponding to each frame of the video signal, and performingprocessing for an image display by the video signal based on theidentification signal.

According to another aspect of the present invention to solve the aboveissue, there is provided a program causing a computer to function as aunit to decode a video signal received from outside, to acquireidentification information including a format concerning 2D or 3D ofvideo from a layer corresponding to each frame of the video signal, andto perform processing for an image display by the video signal based onthe identification signal.

According to another aspect of the present invention to solve the aboveissue, there is provided a data structure of a digital broadcastingsignal containing a video signal related to content to be broadcast,wherein identification information including a format concerning 2D or3D of video is inserted into a layer corresponding to each frame of thevideo signal and a receiver is caused to function as a unit to performprocessing for an image display by the video signal.

Advantageous Effects of Invention

According to the present invention, when a video signal includingstereoscopic video is switched, a proper display can be realizedimmediately after the switching.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a schematic diagram showing a configuration example of areceiver according to an embodiment of the present invention.

FIG. 2 is a schematic diagram showing an example in which switching of3D_Type occurs.

FIG. 3 is a schematic diagram providing an overview of information addedto a stream syntax of a codec layer of each frame.

FIG. 4 is a schematic diagram showing details of information added tothe stream syntax of the codec layer of each frame.

FIG. 5 is a schematic diagram illustrating details of a 3D type.

FIG. 6 is a schematic diagram illustrating details of the 3D type.

FIG. 7 is a schematic diagram showing other information added to thesyntax.

FIG. 8 is a schematic diagram showing a case when the video is switchedfrom 3D to 2D.

FIG. 9 is a schematic diagram showing a case when the video is switchedfrom 3D to 2D and then, the video is switched from 2D to 3D.

FIG. 10 is a schematic diagram showing parameters before and afterswitching in a Backend Processor.

FIG. 11 is a schematic diagram illustrating a concept of view copy.

FIG. 12 is a schematic diagram illustrating changes of memory settings.

FIG. 13 is a schematic diagram showing the configuration of atransmitter according to the present embodiment.

FIG. 14 is a schematic diagram showing frames corresponding to the 3Dtype when 3D data is an interlaced type of a frame based fieldsequential method.

FIG. 15 is a schematic diagram showing frames corresponding to the 3Dtype when 3D data is of an interlaced type of the field sequentialmethod.

FIG. 16 is a schematic diagram showing frames corresponding to the 3Dtype when 3D data is in side-by-side mode.

FIG. 17 is a schematic diagram showing frames corresponding to the 3Dtype when 3D data is in top and bottom mode.

FIG. 18 is a schematic diagram showing an operation of the receiverbased on a conversion prohibition flag.

FIG. 19 is a schematic diagram showing the operation of the receiverbased on the conversion prohibition flag.

FIG. 20 is a schematic diagram showing the operation of the receiverbased on the conversion prohibition flag.

DESCRIPTION OF EMBODIMENT

Hereinafter, preferred embodiments of the present invention will bedescribed in detail with reference to the appended drawings. Note that,in this specification and the drawings, elements that have substantiallythe same function and structure are denoted with the same referencesigns, and repeated explanation is omitted.

The description will be provided in the order shown below:

-   -   (1) Configuration example of the receiver    -   (2) Example of switching from the 3D type    -   (3) Information added to the codec layer    -   (4) 3D type switching    -   (5) Parameter before and after switching in the Backend        Processor    -   (6) Concept of view copy    -   (7) Changes of memory settings    -   (8) Configuration example of the transmitter        [(1) Configuration Example of the Receiver]

First, an outline configuration of a receiver 100 according to thepresent embodiment will be described based on drawings. FIG. 1 is aschematic diagram showing a configuration example of the receiver 100.As an example, the receiver 100 is a TV set that receives digitalterrestrial broadcasting or the like and receives a bit stream of, forexample, stereoscopic video (3D video) composed of left-eye video andright-eye video and decodes the bit stream before causing a displaypanel to display the bit stream. The receiver 100 also receives a bitstream of normal two-dimensional video (2D video) and combines the bitstream before causing the display panel to display the bit stream.

A transmitter according to the present embodiment transmits byappropriately switching 3D and 2D video in accordance with videosources. For example, the main portion of a program is transmitted as 3Dvideo and commercials inserted therebetween is transmitted as 2D video.When transmitting 3D video, the transmitter may transmit byappropriately switching 3D video of different formats (modes). 3D videoformats include, for example, the side-by-side mode, top and bottommode, and frame sequential mode.

Thus, it becomes necessary for the receiver 100 side to switch receptionprocessing statically or dynamically in accordance with video switchingfrom the transmitter. Thus, in the present embodiment, when dynamicallyswitching between 3D formats or between 3D and 2D, a switching format isinserted into the stream syntax of the layer of codec and also aparameter of the time offset before switching is inserted. Accordingly,sufficient temporal preparations can be secured for the processingsystem of an image processor on the side of the transmitter 100 fortarget timing of switching.

If, for example, dynamically switched between 3D and 2D, an extra memoryis necessary. Moreover, when switching occurs, it may become necessaryto release a memory. As will be described later, it may take time inunits of seconds to release or initialize a memory.

In such a case, if time information before switching is input using acontainer layer such as the time stamp (TS), the time informationdepends on the time stamp and thus, it is difficult to directlyassociate the direct time relationship with V synchronization of video.Furthermore, the time stamp may not be attached to all pictures andthus, it is difficult to control switching with precision insynchronization with video frames.

Thus, according to the present embodiment, the parameter of time offsetbefore switching is basically inserted into a position corresponding tothe picture header in the layer of codec. Accordingly, a parameter isinserted into each frame of a video signal so that the timing of thevideo signal can be processed in units of frame. The timing and positionof a left-eye image L and a right-eye image R are inserted as otherparameters. Accordingly, decoded pictures can be displayed in thecorrect order. In the description that follows, it is assumed that SEI(Supplemental enhancement information: user data) is defined in theMPEG-4 AVC (H.264/MPEG-4 AVC) standard and is inserted into a pictureunit equivalent position. Accordingly, identification information isinserted into each frame and a direct relationship with a 3D videoformat is maintained so that switching timing in the video cycle caneasily be managed. The above information is inserted into the pictureheader of a bit stream, but in the case of, for example, MPEG-4 AVC, theinformation may additionally be inserted into sequence parameters(sequence header), GOP unit equivalent picture parameters or the like.Accordingly, parameters like the time offset can be acquired withoutviewing information in a higher layer. The above content also applieswhen the codec is not only MPEG4 AVC, but also MPEG2 video, VC-1, orMPEG4 Visual.

As will be described later, 2D content and 3D content are mixed in onestream in the present embodiment and when 2D is transmitted after beingconverted into a 3D format, a flag is provided so that the display sidecan identify a difference whether the format before the conversion is 2Dor 3D.

As shown in FIG. 1, the receiver 100 includes a header parser 102, anelementary decoder 104, a backend processor 106, and a system control110. The configuration shown in FIG. 1 can be set up by a circuit(hardware) or a CPU (central processing unit) and a program (software)to cause the circuit or CPU to function. In this case, the program canbe stored in a memory included in the receiver 100 or an externalstorage medium.

In FIG. 1, the header parser 102 detects that 3D is contained in astream received from the transmitter and notifies the system control 110of the detection. The elementary decoder 104 decodes video data of thestream received from the transmitter and outputs the decoded video datato the backend processor 106. The elementary decoder 104 also detectsinformation of 3D_Type in the video stream and notifies the systemcontrol 110 of the image size, frame structure, and frame frequencyalong with Picture_type information. If the video data has a time stamp,the elementary decoder 104 adds the time stamp to the above informationbefore a notification is made to the system control 110.

The system control 110 specifies pictures to be displayed from theoutput buffer of the decoder in the order of time stamp or in accordancewith Picture_Type and makes a notification thereof to the backendprocessor 106. In accordance with the notification from the systemcontrol 110, the backend processor 106 sets the output order of eachpicture fetched from the buffer in which decoded data is accumulated andoutputs the output order to the subsequent stage. If, for example, Bpictures of MPEG-4 AVC are decoded, a re-order occurs and the backendprocessor 106 specifies the output order of each frame underinstructions of the system control 110.

At this point, the system control 110 notifies the backend processor 106of information of 3D_Type, image size, and frame structure of the videodata and also specifies necessary spatial scaling or temporal processingsuch as deinterlacing. Then, during the count-down by FrameCountDown,the system control 110 specifies the time (number of frames) beforeswitching in accordance with the value thereof and the newly switched3D_Type.

Preparations are made inside the backend processor 106 so thatprocessing can be switched smoothly in the timing when 3D_Type switchingoccurs and the processing system is switched in synchronization with theswitching point. As will be described in detail later, the backendprocessor 106 mainly performs spatial and temporal scaling processing inaccordance with 3D_Type and 2D_Type and prepares the memoryconfiguration so that processing can be switched smoothly.

As processing specific to 3D video, the backend processor 106interleaves image data of the left-eye image L and the right-eye image Rafter being decoded in the time direction to make the image datasuitable for a 3D display system. Alternatively, the backend processor106 interleaves the left-eye image L and the right-eye image R in thedirection perpendicular to the screen for each line to make the imagedata suitable for a 3D display system. Both display methods require 3Dglasses, but as is publicly known, display devices with naked eyesalready exist.

[(2) Example of Switching from the 3D Type]

FIG. 2 is a schematic diagram showing an example in which switching of3D_Type occurs and shows a case when 3D_Type changes from the top andbottom mode to the frame sequential mode. FIG. 2 shows how frames changefrom #1 to #7 chronologically. Each frame has information of the framecount-down value (FrameCountDown), 3D type reset flag(3D_Type_Reset_flag), 3D type (3D_Type), and next 3D type afterswitching (Next_3D_type_after_reset) added to the stream syntax of thecoded layer. Thus, the receiver 100 acquires these information from eachframe.

The frame count-down value is a value indicating the number of framesbefore switching. In the example in FIG. 2, switching is not specifiedfor frame #1 and the frame count-down value for frame #1 is set to 0xFF.Thus, when data of #1 is received, no switching is recognized. The nextframe #2 has the count-down value of 2. This shows that switching willoccur when two frames pass after frame #2. The frame count-down valuedecreases by 1 in the subsequent frames #3 and #4 and becomes 0 in frame#4 immediately before switching.

The 3D type reset flag is a flag indicating that the 3D type is reset.If the 3D type reset flag is set to 0, this indicates that the 3D typeis not reset and the same 3D is set in the next frame. On the otherhand, if the 3D type reset flag is set to 1, the 3D type is reset in theframe to end the 3D type up to the current frame and a different 3D typeis set in the next frame.

The 3D type is information indicating the 3D type of the current frame.In frame #1, the 3D type is 000100 and this 3D type indicates that, aswill be described later, the 3D mode of the current frame is the top andbottom mode. In frame #5 after the 3D type is switched, the 3D type is000000 and this 3D type indicates that the 3D mode of the current frameis the 2D mode in the related art.

The next frame type is information indicating the frame type after areset. If the 3D type is reset, the 3D type after switching is shown inthe next frame type. If, for example, the count-down value is set to 2in frame #2, the frame is reset after two frames and 000000 indicatingthe 2D in the related art is specified as the next frame type. In thestage of frame #1, the frame count-down value is 0x00 and whether the 3Dtype will be switched is not indicated and thus, the next frame type isindicated as xxxx.

Therefore, in the example in FIG. 2, data is transmitted in top andbottom mode in frame #1 without indicating whether the 3D type will beswitched. In frame #2, the frame count-down value is set to 2, whichindicates that the current 3D type (top and bottom mode) is reset aftertwo frames and switched to the frame sequential mode, which is the nextframe type. In frame #3, the frame count-down value is set to 1, whichindicates that the current 3D type is reset in the next frame andswitched to the 2D in the related art, which is the next frame type.Then, in frame #4, the frame count-down value is set to 0 and the 3Dtype reset flag is set to 1, which indicate that the current 3D type isreset in the current frame and from the next frame onward, the 3D typeis switched to the 2D in the related art, which is the next frame type.

In frame #5, the 3D type is set to 000000, which indicates that 3D dataof the current frame is in 2D mode in the related art. Moreover, theframe count-down value is set to 0xFF and the 3D type reset flag is setto 0 and thus, whether the 3D type will be switched is not indicated.Because no occurrence of switching is indicated, the next frame type isindicated as xxxxxx.

In the example in FIG. 2, as described above, information of the framecount-down value, 3D type reset flag, 3D type, and next 3D type is addedto the stream syntax of codec layer. Therefore, the receiver 100 canacquire the 3D type of the current frame and whether switching of the 3Dtype occurs next and timing up to switching of the 3D type can beacquired in advance. Accordingly, preparations for processing of thescaler in accordance with switching and preparations for changing thememory configuration can be made in advance.

[(3) Identification Information Added to the Codec Layer]

FIG. 3 is a schematic diagram providing an overview of identificationinformation added to the stream syntax of the codec layer of each frame.As shown in FIG. 3, the 8-bit 3D format signaling ID is added to thestream syntax as user data and also the 3D type reset flag (1 bit),conversion prohibition flag (1 bit), reserved bits (6 bits), framecount-down value (Frame_Countdown_to_reset, 8 bits), 3D type (6bits),view copy (View_copy, 1 bit), L_First_flag (1 bit), next 3D type (6bits), and Reference_Dependency (2 bits).

The 3D format signaling ID is generally user data and corresponds, inMPEG4-AVC, to the user definition of SEI. Each of the above data can beadded in the same manner as buffering period SEI and picture timing SEIdefined in MPEG and is added to each of all frames. The information willbe described in detail below.

FIGS. 4, 5, 6, and 7 are schematic diagrams showing details ofinformation added to the stream syntax of the codec layer of each frame.If, as shown in FIG. 4, the 3D type reset flag is 1, the current 3D typeis reset in the current frame. If the 3D type reset flag is 0, thecurrent 3D type is maintained. The frame count-down value indicates thetime offset value up to the 3D type reset and when the value thereofbecomes 0x00, the 3D type reset flag is set to 1. If the frame is notreset immediately in the future and the frame count-down value is notused, the value thereof is set to 0xFF.

FIGS. 5 and 6 are schematic diagrams illustrating details of the 3Dtype. As shown in FIGS. 5 and 6, in addition to the side-by-side modeand frame sequential mode shown in FIG. 2, various modes can bespecified as 3D type information. As will be described below, 100000 and100001 indicate a case when 3D data is progressive data in fieldsequential mode. 100000 indicates a 1st view and 100001 indicates a 2ndview. Basically, one of the 1st view and the 2nd view is the left-eyeimage L and the other is the right-eye image R. Which of the 1st viewand the 2nd view corresponds to the left-eye image L or the right-eyeimage R is specified by L_First_Flag.

110000, 110001, 110010, and 110011 indicate a case when 3D data isinterlaced data in frame-based field sequential mode. 11000 indicatesthat the top field is 1st view of 1st field of a frame pair and thebottom field is 2nd view of 1st field. 110000 corresponds to FIG. 14A.110001 indicates that the top field is 1st view of 2nd field of a framepair and the bottom field is 2nd view of 2nd field. 110001 correspondsto FIG. 14B. 110010 indicates that the top field is 1st view of 1stfield of a frame pair and the bottom field is 2nd view of 2nd field.110010 corresponds to FIG. 14C. 110011 indicates that the top field is1st view of 2nd field of a frame pair and the bottom field is 2nd viewof 1st field. 110011 corresponds to FIG. 14D.

010000 and 010001 indicate a case when 3D data is in field sequentialmode and 010000 indicate 1st view of 1st field. 010001 indicates 2ndview of 1st field.

010010 and 010011 indicate a case when 3D data is in field sequentialmode and 010000 indicate 1st view of 2nd field. 010011 indicates 2ndview of 2nd field. 010000, 010001, 010010, and 010011 correspond toFIGS. 15A, 15B, 15C, and 15D, respectively.

Thus, in the method of allocating 1st view/2nd view to the topfield/bottom field in field sequential mode, whether to adopt fields ofthe same timing or fields of different timing of respective sourceimages of two views of 1st view/2nd view is made specifiable. Thisenables an adaptive configuration in accordance with parallaxcharacteristics of source images. Here, there are cases when 1st viewadopts the top view of a source image and 2nd view also adopts the topfield of the source image and when 1st view adopts the top view of thesource image and 2nd view adopts the bottom field of the source image.

001000 and 001001 indicate the side-by-side mode and also indicate thatthe left side is 1st view and the right side is 2nd view. Of these,001000 indicates that the left side and the right side have the samesampling position and the sampling position is in even pixels. 001001indicates that the left side and the right side have the same samplingposition and the sampling position is in odd pixels. 001000 and 001001correspond to FIGS. 16A and 16B, respectively. 001010 and 001011indicate the side-by-side mode and also indicate that the left side is1st view and the right side is 2nd view. Of these, 001010 indicates thatthe left side and the right side have alternative sampling positions and1st view is in even pixels and 2nd view is in odd pixels. 001011indicates that the left side and the right side have alternativesampling positions and 1st view is in odd pixels and 2nd view is in evenpixels. 001000 and 001001 correspond to FIGS. 16C and 16D, respectively.

000100 and 000101 indicate the top and bottom mode and also indicatethat the upper side is 1st view and the lower side is 2nd view. Ofthese, 000100 indicates that the upper side and the lower side have thesame sampling position and 1st view and 2nd view are both sampled ineven lines. 000101 indicates that the upper side and the lower side havealternative sampling positions and 1st view is sampled in even lines and2nd view is sampled in odd lines. 000100 and 000101 correspond to FIGS.17A and 17B, respectively.

000110 and 000111 indicate the top and bottom mode and also indicatethat the upper side is 1st view and the lower side is 2nd view. Ofthese, 000110 indicates that the upper side and the lower side havealternative sampling positions and 1st view is sampled in odd lines and2nd view is sampled in even lines. 000111 indicates that the upper sideand the lower side have alternative sampling positions and 1st view and2nd view are both sampled in odd lines. 000110 and 000111 correspond toFIGS. 17C and 17D, respectively.

By deciding the 3D type as described above, when 1st view and 2nd vieware interleaved, whether to allocate the same temporal phase ordifferent phases to the two views can explicitly be shown. By adoptingdifferent temporal phases, temporal precision of quality in accordancewith the parallax can be improved with different phases whenchronologically interleaving in frame sequential or field sequentialmode.

Moreover, in spatial phases, particularly sub-samples in the horizontaldirection, whether to allocate the same sample phase or different phasescan explicitly be shown. Accordingly, for example, when spatiallyinterleaving two views in side-by-side mode, horizontal spatialresolution in a portion with less parallax can be improved by adoptingdifferent spatial phases. Also, in sub-samples in the verticaldirection, whether to allocate the same sample phase or different phasescan explicitly be shown. Accordingly, when spatially interleaving twoviews in top and bottom mode, vertical spatial resolution in a portionwith less parallax can be improved by adopting different spatial phases.According to 3D type settings as described above, a quality improvementeffect can be realized by selectively specifying and coding the 3D typewhen transmitting each format.

If the 3D type is 000000, no 3D format is specified, indicating thatvideo data is 2D. If information of the 3D type (6bits) shown in FIG. 4is not added, the receiver 100 handles the data as 2D data.

According to the 3D type information as described above, whether theframe sequential, field sequential, side-by-side, top and bottom, or 2Das the 3D format can be judged by checking the bit where the first 1 of6 bits is set. Incidentally, the receiving side may reject any other 3Dtype than the above types.

Information of view copy shown in FIG. 4 is information indicatingwhether a 3D frame pair (1st view, 2nd view) has been created bycopying. If the view copy is 1, this indicates that 2nd view is a copyof 1st view. In this case, 1st view and 2nd view are the same and the 3Dframe pair is handled as 2D data on the side of the receiver 100. If theview copy is 0, this indicates that 1st view and 2nd view areindependent data and in this case, the receiver side can recognize thatthe 3D frame pair is 3D data.

FIG. 7 is a schematic diagram showing other information added to thesyntax. L_First_Flag is a flag indicating which of the 1st view and the2nd view corresponds to the left-eye image L or the right-eye image R.If L_First_Flag is 1, this indicates that 1st view is the left-eye imageL. If L_First_Flag is 0, this indicates that 1st view is the right-eyeimage R. The next 3D type is, as described above, information thatindicates the 3D type after switching and can be specified in the samemanner as the 3D type.

Reference_Dependency is data indicating whether reference data of motioncompensation of MPEG or the like extends over frames. IfReference_Dependency is 10, a frame of 1st view uses only frames ofother 1st view as reference data and a frame of 1st view does not useany frame of 2nd view as reference data. In this case, no restriction isimposed on 2nd view and a frame of 2nd view can use a frame of 1st viewor other 2nd view as reference data.

If Reference_Dependency is 01, a frame of 2nd view uses only frames ofother 2nd view as reference data and a frame of 2nd view does not useany frame of 1st view as reference data. In this case, no restriction isimposed on 1st view and a frame of 1st view can use a frame of other 1stview or 2nd view as reference data.

If Reference_Dependency is 11, a frame of 1st view uses only frames ofother 1st view as reference data and a frame of 2nd view uses onlyframes of other 2nd view as reference data. If Reference_Dependency is00, there is no restriction of reference data for any view.

With an arrangement of Reference_Dependency as described above,processing can be made more efficient by, for example, decoding only 1stview when a 2D image is displayed with the left-eye image L only ofreceived left and right images by only the left-eye image beingreferenced by 1st view for the left-eye image L.

In the present embodiment, information in FIGS. 3 to 7 is inserted intoSEI of MPEG4 AVC. On the other hand, in other codec standards, theinformation can be inserted into a corresponding location in each frameof a video signal. For example, in MPEG2 video, VC1, or MPEG4 Visual,switching synchronized with the video signal can be realized byinserting insertion into each frame in the layer of a video signal.

For MPEG2 video (H.262 ISO/IEC IS 13818.2 Information Technology—GenericCoding of Moving Picture and Associated Audio Information: Video), forexample, the insertion can be inserted into the User data area of thepicture header defined for the format. Also in this case, data mayfurther be inserted into sequence header, slice header, or macro blockheader.

[(4) 3D Type Switching]

Next, 3D type switching will be described in detail. FIG. 8 is aschematic diagram showing a case when the video is switched from 3D to2D. The upper part of FIG. 8 schematically shows data of each framereceived by a receiver and frames #1 to #6 are transmitted at afrequency of 60 [Hz] as progressive data in frame sequential mode after100000 and 100001 being alternately specified as the 3D type. Then,after data is switched to 2D data in frame #7, 2D data is transmitted at30 Hz. The 3D type is set to 000000 (Non-3D format) for frame #7 andsubsequent frames.

The lower part of FIG. 8 schematically shows data (Decoded Frames) ofeach frame output from the decoder 104 and data (BackEnd Proc out) offrames output from the backend processor 106. The lower part of FIG. 8schematically shows a state when viewed from above frames. Data of eachframe at the frequency of 60 [Hz] is output up to frame #6 from thedecoder 104 and data of frame #7 and subsequent frames is output at thefrequency of 30 [Hz].

The backend processor 106 receives output of the decoder 104 and outputsframes #1 to #6 to the display panel at the frequency of 120 [Hz]. Thebackend processor 108 temporally scales frame #7 and subsequent framesto convert into 120 [Hz] data and outputs the data to the display panel.More specifically, the backend processor 106 performs interpolationprocessing between, for example, frame #7 and frame #8 to generate frame#7-8 and also generates frame #8-9 by performing interpolationprocessing between frame #8 and frame #9. As another method, frame #7may be copied as frame #7-8 and frame #8 may be copied as frame #8-9. Ifframes #7-8 and #8-9 are generated on the side of the transmitter 100 inadvance by view copying, the receiver 100 can display received 2D dataas it is. In the case of view copying, 2D data is transmitted at thefrequency of 120 [Hz].

In the example of FIG. 8, the receiver 100 can acquire switching timingfrom 3D to 2D based on the frame count-down value added to each frame.Thus, the backend processor 108 can recognize switching timing of frame#7 in advance and can make preparations necessary for scaling of 2Ddata. Therefore, when video is switched from 3D to 2D, the display canbe made continuously without causing a time lag.

FIG. 9 is a schematic diagram showing a case when the video is switchedfrom 3D to 2D and then, the video is switched from 2D to 3D. As shown inthe upper part of FIG. 9, frames #1 to #4 have interlaced (60i) data of60 [Hz] with 1920×1080 pixels and the 3D flag of each frame indicates“3D”. Then, when 3D data is switched to 2D data in frame #5, interfaced(60i) data of 60 Hz with 1920×1080 pixels is received. Then, 2D data isswitched to 3D data again in frame #11 to obtain interfaced (60i) dataof 60 Hz with 1920×1080 pixels.

The lower part of FIG. 9 schematically shows data (Decoded Frames) ofeach frame output from the decoder 106 and data (BackEnd Proc out) offrames output from the backend processor 106. The lower part of FIG. 9schematically shows a state when viewed from above frames and 3D data isin side-by-side mode and thus, L, R data is arranged side by side in thehorizontal direction of the screen. Data of each frame in side-by-sidemode at the frequency of 60 [Hz] is output up to frame #4 from thedecoder 104 and 2D data of frames #5 to #10 is output at the frequencyof 60 [Hz]. For frame #11 and subsequent frames, data of each frame inside-by-side mode at the frequency of 60 [Hz] is output.

The backend processor 106 receives output of the decoder 104 and expandseach piece of L, R data (each having 960×1080 pixels) in side-by-sidemode of frames #1 to #4, #10 to #14 by performing filtering(complementary processing) thereon in the screen horizontal direction todata of 1920×1080 pixels and scales the data in the time base directionbefore sending the data at 120 [Hz] to the display panel. The backendprocessor 108 scales (interpolation processing) of 2D frames #5 to #10in the time base direction to convert to 120 [Hz] data and sends thedata to the display panel.

Also in the example of FIG. 9, the receiver 100 can acquire switchingtiming of 3D↔2D from the frame count-down value added to each frame.Thus, the backend processor 108 can recognize switching timing of frame#4 in advance and can make preparations necessary for interpolationprocessing of 2D data. Also, the backend processor 108 can recognizeswitching timing of frame #10 in advance and can make preparationsnecessary for 3D processing. Therefore, when video is switched between3D and 2D, the display can be made continuously without causing a timelag.

In FIG. 8, the left-eye image L, the right-eye image R, or 2D image datain frame sequential mode is stored in different areas of the memoryafter being decoded. Similarly, in FIG. 9, the left-eye image L, theright-eye image R, or 2D image data in side-by-side mode is stored indifferent areas of the memory after being decoded. Thus, memory areasare different before and after switching and it is necessary toinitialize in advance depending on switching timing. In the presentembodiment, the time offset before switching is known in advance fromthe time offset value and preparations for memory initialization can bemade before switching.

In the example in FIG. 9, the algorithm for interpolation processing isdifferent between 3D and 2D in processing by the backend processor 106.Also in this case, the time offset before switching is known in advancefrom the time offset value and preparations for algorithm switching canbe made before switching.

[(5) Parameter Before and After Switching in the Backend Processor]

FIG. 10 is a schematic diagram showing parameters before and afterswitching in the backend processor 108 when the above switching occurs.The first column in FIG. 10 shows parameter changes when video isswitched between 3D and 2D after video data in field sequential (FS)mode being received. In this example, it is assumed that the 3D data has1280×720 pixels and the frequency of left/right video is 30×2=60 [Hz].That is, L, R data is each output from the decoder 104 at the frequencyof 30 [Hz] and the frequency of left/right video is 30×2=60 [Hz]. On theother hand, the 2D data also has 1280×720 pixels, but is output from thedecoder 104 at the frequency of 30 [Hz]. Thus, when switching from 3D to2D occurs, the backend processor 108 performs scaling in the timedirection to perform complementary processing of the 2D data for outputat the frequency of 60 [Hz]. Accordingly, when video is switched between3D and 2D, the video before and after switching can be made to have thesame frequency and processing of the display panel can be performed inthe same manner and also a sense of discomfort felt by the user duringswitching can be reduced to a minimum.

The second column in FIG. 10 shows parameter changes when video isswitched between 3D and 2D after the 3D data in side-by-side (SBS) modebeing received. In this example, it is assumed that each of L, R of the3D data has 960×540 pixels and one frame in which L and R are arrangedin the screen horizontal direction is output from the decoder 104 at thefrequency of 60 [Hz]. On the other hand, the 2D data has 1920×540 pixelsand is output from the decoder 104 at the frequency of 60 [Hz]. Thus,the backend processor 108 performs scaling of each of L, R of the 3Dvideo in the space direction to output data of 1920×540 pixels. Morespecifically, filtering (complementary processing) to increase thenumber of pixels in the screen horizontal direction of a frame to960×2=1920 is performed. On the other hand, both of 3D and 2D have theoutput frequency of 60 [Hz] from the decoder 104 and thus, no scaling isperformed. Accordingly, when video is switched between 3D and 2D, thevideo before and after switching can be made to have the same number ofpixels.

The third column in FIG. 10 shows parameter changes when video isswitched between 3D and 2D after the 3D data in top and bottom (TAB)mode being received. In this example, it is assumed that each of L, R ofthe 3D data has 1920×270 pixels and one frame in which L and R arearranged in the screen horizontal direction is output from the decoder104 at the frequency of 60 [Hz]. On the other hand, the 2D data has1920×540 pixels and is output from the decoder 104 at the frequency of60 [Hz]. Thus, the backend processor 108 performs scaling of each of L,R of the 3D video in the space direction to output data of 1920×540pixels. More specifically, complementary processing to increase thenumber of pixels in the screen vertical direction of a frame to270×2=540 is performed. On the other hand, both of 3D and 2D have theoutput frame frequency of 60 [Hz] from the decoder 104 and thus, noscaling is performed. Accordingly, when video is switched between 3D and2D, the video before and after switching can be made to have the samenumber of pixels.

In the spatial and temporal scaling of image data as described above, itis difficult to display scaled proper images immediately after switchingif parameters after conversion are not known in advance. In the presentembodiment, the frame count-down value and the next 3D type afterconversion can be acquired before the conversion and thus, switchingpreparations for spatial and temporal scaling can be made before theswitching timing. Therefore, properly scaled images can be displayedimmediately after switching.

FIG. 11 is a schematic diagram illustrating the concept of view copy.The view copy is used to copy one of L, R of 3D video as the other tocreate the same video data of L, R on the transmitter side and totransmit the data as 2D data to the receiving side. As shown in FIG. 11,in top and bottom mode, original frame data (Original Frame) iscompressed vertically (Scaling Vertically) and the same compressed datais arranged vertically in each of L, R areas of top and bottom. If, forexample, the original frame is the left-eye image L and the 2D displayis made only with the left-eye image L by view copying, data of theleft-eye image L compressed vertically is arranged in the area of theleft-eye image L of the top and bottom and the left-eye image L iscopied to the area of the right-eye image R (lower area).

In side-by-side mode, original frame data is compressed horizontally(Scaling Horizontally) and the same compressed data is arrangedvertically in each of

L, R areas of side by side. If, for example, the original frame is theleft-eye image L and the 2D display is made only with the left-eye imageL by view copying, data of the left-eye image L compressed horizontallyis arranged in the area of the left-eye image L of the side by side andthe left-eye image L is copied to the area of the right-eye image R(right area).

In frame sequential mode, predetermined scaling (compression) isperformed on the original frame data and then, data of a frame N, whichis the left-eye image L, to the next frame N+1 in the time basedirection. Accordingly, while the original frame N+1 of 3D has theright-eye image R, the left and right videos become the same by copyingthe left-eye image L and the 2D video can be displayed with the left-eyeimage L only.

If View_Copy described in FIG. 4 is 1, 2nd view is a copy of 1st viewand a view copy as shown in FIG. 11 is made. Thus, when 2D data isreceived, the receiver 100 can recognize that the 2D data is originally3D data and is generated by copying one of L, R of original data to theother by referencing data of View_Copy.

[(7) Changes of Memory Settings]

FIG. 12 is a schematic diagram illustrating changes of memory settings.As shown in the upper part of FIG. 12, a de-interlacer 106 a, a scaler106 b, and a frame interleaver 106 c are arranged in the subsequentstage of the decoder 104 and the function block is mainly configured bythe backend processor 106. The lower part of FIG. 12 shows output fromthe decoder 104 and each of cases when video data has interlaced framesand progressive frames is shown. For interlaced frames, an (N−1)-thframe and an N-th frame are composed of a frame pair of Even Field andOdd Field and L, R data of one frame is composed of a frame pair. Forprogressive frames, on the other hand, the number of frames is doublethat of interlaced frames and a 2*N-th frame and a (2*N+1)-th frame areconfigured by L, R data being arranged in one frame.

A memory necessary to hold these data is allocated to a memory space bythe system control 110. The necessary memory configuration changesdepending on 3D_type and the controller initializes memory allocationwhen 3D_Type is switched. The memory is frequently initialized in timingdependent on the operating system and the timing necessary forinitialization may be in the range of 10 ms to several tens of ms. Onthe other hand, filter switching by the de-interlacer 106 a or thescaler 106 b is performed after normal processing being temporarilystopped.

In view of these factors, a switching preparation period ranging from aplurality of video frames or a plurality of video vertical periods toseveral seconds for 3D_Type switching is necessary. Thus, it isnecessary to transmit information about a time difference up toswitching timing of 3D_Type from the transmitting side to the receivingside to cause the display panel to display new 3D_Type after switchingfrom the initial video frame onward.

In the present embodiment, if switching occurs as described above, theswitching timing is indicated in advance by the frame count-down valuebefore the switching timing. Thus, by making preparations for memoryinitialization in advance, the receiver 100 can cause the display panelto display new 3D_Type after switching from the video frame immediatelyafter switching onward.

[(8) Operation of Conversion Prohibition Based on the ConversionProhibition Flag]

Next, the operation of conversion prohibition based on the conversionprohibition flag will be described. As described in FIG. 3,identification information added to the stream syntax of the codec layerof each frame contains the conversion prohibition flag(Prohibit_Convert_flag). The conversion prohibition flag is set to “1”when content transmitted from the transmitting side should be prohibitedfrom being converted according to the intention of the transmittingside. On the other hand, the conversion is permitted according to theintention of the transmitting side, the conversion prohibition flag isset to “0”.

Like a case when, for example, the transmitted content is premium 3Dvideo, content that is not assumed to be displayed as 2D video on thetransmitting side may be delivered. In such a case, conversion to 2D bythe receiver 100 is prohibited by setting the conversion prohibitionflag to “1” on the transmitting side. Similarly, if transmitted contentis 2D video and conversion to 3D video is not assumed on thetransmitting side, conversion to 3D by the receiver 100 is prohibited bysetting the conversion prohibition flag to “1” on the transmitting side.

FIGS. 18, 19, and 20 are schematic diagrams showing the operation of thereceiver 100 based on the conversion prohibition flag. FIGS. 18 and 19show a case when content transmitted from the transmitting side is 3Dvideo, FIG. 18 shows a case of the frame sequential mode, and FIG. 19 isa case of the side-by-side mode. FIG. 20 shows a case when thetransmitted content is 2D video.

In the field sequential mode shown in FIG. 18, like in FIG. 8, a frameof the left-eye image L and a frame of the right-eye image R arealternately output frame sequentially from the decoder 106. If, as shownin FIG. 18, the conversion prohibition flag is “0”([i]Prohibit_Convert_flag=0), the conversion is permitted and thereceiver 100 can normally 3D-play back the received content (normal 3Dplayback). If the conversion prohibition flag is “0”, the receiver 100can also convert 3D video into 2D video to play back 2D converted video(2D convert playback). For normal 3D playback, the backend processor 106performs predetermined 3D processing on data (Decoded Frames shown inFIG. 18) output from the decoder 104 and outputs as data of 3D Proc outshown in FIG. 18. In this case, the backend processor 106 can outputwithout performing scaling in the space direction or time direction. For2D convert playback, only the left-eye image L is extracted from data(Decoded Frames shown in FIG. 18) output from the decoder 104 and outputby the backend processor 106. Thus, 3D video can be converted into 2Dvideo with only the left-eye image L and then played back. Incidentally,for 2D convert playback, only frames of the right-eye image R may beextracted to convert 3D video into 2D video with only the right-eyeimage R for playback.

On the other hand, if the conversion prohibition flag is “1”([ii]Prohibit_Convert_flag=1) in FIG. 18, the conversion is prohibitedand the receiver 100 can normally 3D-play back the received content(normal 3D playback), but is not permitted to convert the 3D video into2D video. Thus, the conversion from 3D video into 2D video does notfunction in the receiver 100. 3D playback is the same as when theconversion prohibition flag is “0”.

In the side-by-side mode shown in FIG. 19, image data in which theleft-eye image L and the right-eye image R are put together in one frameis output from the decoder 104. If, as shown in FIG. 19, the conversionprohibition flag is “0” ([i]Prohibit_Convert_flag=0), the conversion ispermitted and the receiver 100 can normally 3D-play back the receivedcontent (normal 3D playback). If the conversion prohibition flag is “0”,the receiver 100 can also convert 3D video into 2D video to play back 2Dconverted video (2D convert playback). For normal 3D playback, scalingprocessing (interpolation processing) is horizontally performed on leftand right images of data (Decoded Frames shown in FIG. 19) output fromthe decoder 104 by the backend processor 106 and the left-eye image Land the right-eye image R are alternately output for each framechronologically as 3D Proc out data shown in FIG. 19. For 2D convertplayback, scaling processing is performed on only the left-eye image Lof data (Decoded Frames shown in FIG. 19) output from the decoder 104 bythe backend processor 106 and only the left-eye image L is output ineach frame. Thus, 3D video can be converted into 2D video with only theleft-eye image L and then played back. Incidentally, for 2D convertplayback, scaling processing may be horizontally performed only on theright-eye image R to convert 3D video into 2D video with only theright-eye image R for playback.

On the other hand, if the conversion prohibition flag is “1”([ii]Prohibit_Convert_flag=1) in FIG. 19, the conversion is prohibitedand the receiver 100 can normally 3D-play back the received content(normal 3D playback), but is not permitted to convert the 3D video into2D video. Thus, the conversion from 3D video into 2D video does notfunction in the receiver 100. 3D playback is the same as when theconversion prohibition flag is “0”.

If, as shown in FIG. 20, the transmitted content is 2D video and theconversion prohibition flag is “0” ([i]Prohibit_Convert_flag=0), theconversion is permitted and the receiver 100 can normally 2D-play backthe received content (normal 2D playback). If the conversion prohibitionflag is “0”, the receiver 100 can also convert 2D video into 3D video toplay back 3D converted video (3D convert playback). For normal 2Dplayback, 3D processing is not performed on data for the left-eye imageL (Decoded Frames shown in FIG. 20) output from the decoder 104 by thebackend processor 106 and the data is output as 2D data frames.Accordingly, 2D video with the left-eye image L can be displayed. If 2Dvideo with data of the right-eye image R is transmitted from thetransmitting side, 2D video with the right-eye image R can be displayed.

When 2D Convert playback is performed in FIG. 20, processing to shiftvideo by the parallax is performed on data (Decoded Frames shown in FIG.20) of the left-eye image L output from the decoder 104 by the backendprocessor 106. With this processing, the right-eye image R to be pairedwith the original left-eye image L is generated so that the left-eyeimage L and the right-eye image R are alternately output. Whenprocessing to shift the left-eye image L by the parallax is performed,more realistic 3D video can be generated by making the amount ofparallax different from object to object contained in the left-eye imageL. In this case, various methods can be used to detect objects such as amethod of detecting contour edges and a method of detecting differenceof brightness or contrast.

On the other hand, if the conversion prohibition flag is “1”([ii]Prohibit_Convert_flag=1) in FIG. 20, the conversion is prohibitedand the receiver 100 can normally 2D-play back the received content(normal 2D playback), but is not permitted to convert the 2D video into3D video. Thus, the conversion from 2D video into 3D video does notfunction in the receiver 100. 2D playback is the same as when theconversion prohibition flag is “0”.

[(8) Configuration Example of the Transmitter]

FIG. 13 is a schematic diagram showing the configuration of atransmitter 200 according to the present embodiment. The transmitter 200is used to encode and transmit 3D video or 2D video to the receiver 100via a wire or wireless transmission path.

As shown in FIG. 13, the transmitter 200 includes a CPU 202, a counter204, a switch 206, and a video encoder 208. A database 210 and adatabase 212 are connected to the switch 206. The database 210 and thedatabase 212 may be configured integrally with the transmitter 200 orseparately. The configuration shown in FIG. 13 can be set up by acircuit (hardware) or a CPU (central processing unit) and a program(software) to cause the circuit or CPU to function. In this case, theprogram can be stored in a memory included in the receiver 100 or anexternal storage medium.

The database 210 has data of 3D video A stored therein. The database 212has data of 2D video B stored therein. Data of the video A stored in thedatabase 210 and data of the video B stored in the database 212 aretransmitted to the switch 206. The switch 206 selects one of the video Aand the video B and sends the selected video to the video encoder 208.The switch 206 sends one of the video A and the video B to the videoencoder 208 in accordance with the count number of frames by the counter204.

The counter 204 counts the number of frames of video data transmitted tothe video encoder 208 based on instructions from the CPU 202 and sendsthe count number up to switching to the switch 206. If, for example, the3D video A is data of the main portion of a program and the 2D video Bis commercial data, the CPU 202 sends the count number of framescorresponding to the time offset up to switching from the video A of themain portion of the program to the video B of the commercial to theswitch 206.

Based on the count number sent from the counter 204, the switch 206switches the video sent to the video encoder 208 in timing in whichswitching occurs. Accordingly, the video can mutually be switchedbetween the video A of the main portion of a program and the video B ofcommercials.

The video encoder 208 encodes data of the video A or the video B sentfrom the switch 206 as a bit stream of MPEG-4 AVC, MPEG-2 Video, VC-1,MPEG4 Visua or the like to transmit the data to the receiver 100.

The counter 204 sends the counter number up to switching to the videoencoder 208. The video encoder 208 adds the count number sent from thecounter 204 to the stream syntax of the codec layer of each encodedframe as the frame count-down value. Based on instructions from the CPU202, the video encoder 208 adds various kinds of data described withreference to FIGS. 3 to 7 to data of the video A or the video B. The bitstream encoded by the video encoder 208 is transmitted from atransmitting unit 214 to the receiver 100.

According to the above configuration, the transmitter 200 can switch the3D video A and the 2D video B based on instructions of the CPU 202. Thetransmitter 200 can add the count-down value indicating the number offrames up to switching to video data based on instructions of the CPU202. The transmitter 200 can also add various kinds of information inFIGS. 3 to 7 to video data. Therefore, based on these information addedby the transmitter 200, the receiver 100 side can perform processingbefore and after switching and make preparations for various kinds ofprocessing concerning switching in advance.

The preferred embodiments of the present invention have been describedabove with reference to the accompanying drawings, whilst the presentinvention is not limited to the above examples, of course. A personskilled in the art may find various alternations and modificationswithin the scope of the appended claims, and it should be understoodthat they will naturally come under the technical scope of the presentinvention.

REFERENCE SIGNS LIST

-   100 Receiver-   104 Decoder-   106 Backend processor-   200 Transmitter-   202 CPU-   208 Video encoder-   206 Framing unit

The invention claimed is:
 1. A video information transmission device,comprising: circuitry configured to: add Supplemental EnhancementInformation (SEI) into a unit of video information, wherein the SEIindicates whether conversion of the video information from a 2D video toa 3D video is prohibited; and transmit the unit with the SEI, whereinthe SEI further indicates at least one of a first time at which a switchof 3D format is to occur or a second time at which a switch between the3D format and 2D format is to occur.
 2. The video informationtransmission device of claim 1, wherein the SEI is within each unit ofthe video information of a video signal.
 3. The video informationtransmission device of claim 2, wherein the SEI is within each unit ofthe video information of the video signal, based on the video signal ofa 3D video that has a frame sequential format.
 4. The video informationtransmission device of claim 1, wherein the SEI further indicates whichof two views corresponds to one of a left eye or a right eye of the 3Dvideo.
 5. The video information transmission device of claim 1, whereinthe SEI further indicates a next 3D video type format after the 3D videois switched.
 6. A video information method, the method comprising:adding Supplemental Enhancement Information (SEI) into a unit of videoinformation, wherein the SEI indicates whether conversion of the videoinformation from a 2D video to a 3D video is prohibited; andtransmitting the unit with the SEI, wherein the SEI further indicates atleast one of a first time at which a switch of 3D format is to occur ora second time at which a switch between the 3D format and 2D format isto occur.
 7. A video information reception device, comprising: circuitryconfigured to: receive a unit of video information along withSupplemental Enhancement Information (SEI), wherein the SEI indicateswhether conversion of the video information from a 2D video to a 3Dvideo is prohibited, and wherein the SEI further indicates at least oneof a first time at which a switch of 3D format is to occur or a secondtime at which a switch between the 3D format and 2D format is to occur;and switch the unit of the video information based on the received SEI.